Clustering Web Documents: A Phrase-Based Method for Grouping Search Engine Results
نویسنده
چکیده
Clustering Web Documents: A Phrase-Based Method for Grouping Search Engine Results
منابع مشابه
Phrase based Clustering Scheme of Suffix Tree Document Clustering Model
Document clustering is one of the difficult and recent research fields in the search engine research. Most of the existing documents clustering techniques use a group of keywords from each document to cluster the documents. Document clustering arises from information retrieval domains, and “It finds grouping for a set of documents belonging to the same cluster are similar and documents belongs ...
متن کاملEfficient Clustering of Web Search Results Using Enhanced Lingo Algorithm
Web query optimization is the focus of recent research and development efforts. To fetch the required information, the users are using search engines and sometimes through the website interfaces. One approach is search engine optimization which is used by the website developers to popularize their website through the search engine results. Clustering is a main task of explorative data mining pr...
متن کاملLingo: Search Results Clustering Algorithm Based on Singular Value Decomposition
Search results clustering problem is defined as an automatic, on-line grouping of similar documents in a search results list returned from a search engine. In this paper we present Lingo—a novel algorithm for clustering search results, which emphasizes cluster description quality. We describe methods used in the algorithm: algebraic transformations of the term-document matrix and frequent phras...
متن کاملSearch Result Clustering Method at NTCIR-5 Web Query Expansion Subtask
We use a retrieval system with search result clustering to tackle the NTCIR-5 WEB Query Term Expansion Subtask. The system clusters the search results in such a way as to make it easier for the user to select relevant documents as feedback documents. In addition, we select phrase words or named entities(NE) as query-expansion keywords from the feedback documents because these words tend to repr...
متن کاملA Tolerance Rough Set Approach to Clustering Web Search Results
Two most popular approaches to facilitate searching for information on the web are represented by web search engine and web directories. Although the performance of search engines is improving every day, searching on the web can be a tedious and time-consuming task due to the huge size and highly dynamic nature of the web. Moreover, the user’s “intention behind the search” is not clearly expres...
متن کامل